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ABSTRACT 

Smith  (1936)  suggested  a  method  that  can  be  used  for  setting 
confidence  limits  on  linear  combinations  of  variances.  This  method 
was  studied  and  expanded  by  Satterthwaite  (1941,  1946)  and  has  be¬ 
come  known  as  Satterthwaite ' s  procedure.  The  procedure  has  been 
widely  used  for  the  past  40  years .  In  this  paper  a  new  procedure 
is  proposed  for  this  problem  that  is  better  than  Satterthwaite 's 
procedure  and  very  easy  to  compute  from  existing  tables. 
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1.  INTRODUCTION 

Let  for  i  “  1,  2,  . . . ,  K  be  independently  distributed 

as  chi-square  random  variables  with  degrees  of  freedom  respec¬ 
tively.  A  problem  which  occurs  frequently  in  statistical  applica¬ 
tions  is  that  of  placing  confidence  intervals  on  linear  combinations 
of  61.  For  example,  consider  the  one-factor  nested  components-of- 
variance  model  vith  equal  numbers  in  the  subclasses  given  by 

«  u  +  A^  +•  where  A^,  e^j  are  jointly  independent,  the  A^  are 
distributed  N(0,  a2)  and  the  e  ,  are  distributed  N(0,  a2)  for  i  -  1, 

A  lj  6 

2,  ....  I;  j  m  1,  2,  ....  J.  A  confidence  interval  on  a2  +  a2,  the 

a  e 

total  variations,  is  frequently  desired.  If  S2  and  S2  denote  respec¬ 
tively  the  Among  and  Within  mean  squares  with  degrees  of  freedom 


n^  and  n^,  and  if  the  expected  mean  squares  are  denoted  by  6^  and  6^ 


°A  +  ae  “  J  91  +  *T  ®2 


is  of  the  form  of  a  posit ive  Linear  combination  of  6^  and  Qj-  Also 

nonnegative  lfxtear  combination  of  the  variance  components  o2  and  o2 

A  e 

given  by  ao2  +  bo2  may  be  desired  where  a  and  b  are  given  constants 
A  e 
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such  that  a  >  0  and  b  >0.  Since  c0,  +  0„  ■  c(o2  +  Jo2)  +  o2  ■ 

1  2  e  A  e 

cJ(o2  +  (c+l)o2/cJ),  a  confidence  interval  on  c0^  +0^  with  c  >  0  is 
equivalent  to  a  confidence  interval  on  ao2  +  bo2  for  any  specified 
a  ^  0,  b  >  0  that  satisfies  a/J  <  b  <  •. 

For  another  example  consider  a  K  -  factor  nested  components-of- 
variance  model  with  equal  numbers  in  all  subclasses  (balanced,  com¬ 
plete  model) .  The  total  variation  is  a  positive  linear  combination 
of  the  expected  mean  squares.  Details  when  K  ■  3  and  K  *  4  can  be 
found  in  several  textbooks  (e.g.,  Graybill  1976,  Chapter  15). 

For  a  final  example  consider  a  two-way  crossed  classification 
components-of-variance  model  with  one  number  in  each  subclass  given 
by  Yjj  ■  p  +  +  Bj  +  E^.  Let  SA»  S^,  represent  the  A,  B,  and 

E  mean  squares  and  0^,  0^,  0 ^  represent  the  corresponding  expected  mean 

squares.  A  quantity  that  is  sometimes  needed  in  this  problem  is  the 
total  variation,  a2  +  <*g  +  The  quantity  a2  +  +  a\  is  a  positive 

linear  combination  of  the  8^.  Many  other  problems  could  be  stated  in 
which  a  positive  linear  combination  of  0^  is  required. 

There  are  no  exact  (the  word  "exact"  means  exact  specified  confi¬ 
dence  coefficient)  confidence  intervals  available  for  nonnegative  linear 
combinations  of  the  0^.  Smith  (1936)  defined  an  estimate  of  a 
linear  function  of  variances  to  be  a  linear  function  of  independent  mean 
squares  and  proposed  approximating  the  distribution  of  such  an  estimate 
by  a  chi-square  distribution  whose  degrees  of  freedom  are  determined  by 
equating  the  variance  of  the  estimate  to  the  variance  of  the  approximating 
(chi-square)  random  variable.  From  this  distribution  one  can  obtain 
approximate  confidence  intervals  on  linear  functions  of  variances.  Satterth- 
waite  (1941,  1946)  studied  this  approximation  and  it  has  become  known  as  the 
Satterthwaite  procedure.  Welch  (1956)  exhibited  a  series  approximation. 
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'  . . . .  " . 

1 

analogous  to  the  Cornish- Fisher  expansion,  for  the  general  problem  of 

finding  confidence  limits  for  linear  combinations  of  several  variances. 

Huitson  (1955)  also  gave  a  method  for  setting  confidence  intervals  on 

linear  combinations  of  variances.  He  arrives  at  some  of  the  methods 

presented  by  Welch,  although  the  details  of  their  derivations  differ 

considerably.  Huitson  includes  a  special  set  of  tables  which  must  be 

used  to  obtain  the  confidence  intervals.  Flelss  (1971)  discusses  the 

Satterthwaite  and  Welch  methods  for  setting  confidence  limits  on  a?  +  a2 

A  e 

for  the  two  factor  cross  component-of-variance  model  and  arrives  at  the 

•> 

conclusion  that  Welch's  method  is  adequate  (and  better  than  Satterthwaite ' s 
method).  Fleiss  only  evaluates  the  cases  where  n^  *  2n^.  However,  when  n^ 
is  large  relative  to  n^,  Welch's  procedure  may  not  be  very  good.  This  is 
demonstrated  in  Table  1.  This  table  was  obtained  by  numerical  integration 
and  the  entries  are  the  ranges  over  which  the  confidence  coefficients  vary 
as  the  unknown  parameter  p  »  cQ^/ (cB^  +  &2>  varies  from  0  to  1.  The  nominal 
confidence  coefficient  is  l  -  a  «  .95.  In  component-of-variance  models  in 
applied  problems  it  is  often  the  case  that  n2  is  much  larger  than  n^  so  the 
conclusions  given  by  Fleiss  may  not  apply  in  those  cases. 

Burdick  and  Sielken  (1978)  propose  a  method  for  constructing  exact  size 
confidence  intervals  for  this  problem,  but  the  expected  lengths  of  their 
intervals  are  extremely  bad.  They  are  sometimes  more  than  800%  larger  than 
the  expected  widths  given  by  Satterthwaite,  so  their  method  cannot  be  re¬ 
commended  for  the  problem  discussed  in  this  paper. 

The  purpose  of  this  paper  is  to  propose  and  evaluate  a  method,  called 

the  Modified  Large  Sample  (MLS)  method,  for  obtaining  confidence  intervals 
K 

on  8  •  E  c,8  with  nonnegative  constants  c, .  The  procedure  proposed  here 
i-1  1  1  1 

is  compared  to  those  of  Satterthwaite  and  Welch. 
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2.  THE  PROPOSED  METHOD 


In  this  section  we  derive  a  method  for  setting  confidence  ln- 
K 

tervals  on  0  *  l  c.0..  To  Illustrate  the  method  we  first  discuss  It 
1-1  1 

for  a  linear  combination  of  two  variances,  i.e.  for  9  -  c0^  +  0^. 

The  UMVU  estimator  0  of  0  is  cS^  +  S2,  and  var[0]  -  c2(202/n^) 

+  2Q^/n^.  Thus  Z  -  (0-0)/  y^var 10]  has  a  limiting  normal  distribution 
with  mean  zero  and  variance  one  as  minO^,^)  -*•  •.  Using  these  re¬ 
sults  an  approximate  1  -  a  confidence  interval  on  0  is  given  by 
cS2  +  S2  -  Na\/c2(202/  nx  )  +  202/n2  £  0  <  cS2  +  S2  +  No\/c2(202/  n^  )  +  202/n~ 

where  is  the  upper  a  probability  point  of  a  standard  normal  p.d.f. 

To  utilize  these  limits,  we  replace  02  and  9 2  by  Sj  and  re¬ 
spectively.  We  then  modify  the  confidence  limits  so  they  might  be 
more  exact  for  small  or  moderate  sample  sizes  by  replacing  the  con¬ 
stants  -N^,  N^,  2/n^,  2/nj  by  general  constants  and  obtain  the  follow¬ 
ing  for  the  approximate  1  -  a  confidence  interval  on  0 
cS2  +  S2  -\/L2c2Sj  +  L2Sj  <  0  <  cSj  +S2  +  \/H2c2Sj  +  H2Sj.  (2.1) 

We  now  determine  L^,  by  forcing  the  confidence  interval 

to  have  an  exact  confidence  coefficient  1  -  a  when  0^-0  and  when 
02  -  0.  When  0^  -  0  it  follows  that  S2  -  0  with  probability  one  so 
we  obtain  L.  -  1  -  1/F  ,  H.  -  1/F  -  1  for  i  -  1, 

i  «n:  ni»  i  °12:  V  * 

2  where  F^,  m  n  uPPer  Y  probability  point  of  Snedecor's  F 

distribution  with  m  degrees  of  freedom  in  the  numerator  and  n  de¬ 
grees  of  freedom  in  the  denominator.  Also  >  0,  >  0* 

■  1-  The  resulting  confidence  interval  on  c0^  +  6^,  called 
the  Modified  Large  Sample  (MLS)  confidence  interval,  is  in  (2.1). 
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The  can  be  chosen  so  that  when  0^*0  for  either  i  ■  1  or  i  ■  2 
the  resulting  confidence  interval  satisfies  one  of  the  following  three 
conditions.  (1)  "Equal  tails"  confidence  intervals  (we  denote  this 
method  by  MLS1  and  in  this  case  a^  ■  a/2  and  ■  1  -  a/2  for  1*1, 
2);  (2)  "Shortest  unbiased"  confidence  Intervals  (we  denote  this  method 
by  MLS 2  and  for  values  of  see  John  (1973));  (3)  "Shortest"  con¬ 

fidence  Intervals  (we  denote  this  method  by  MLS3  and  for  values  L^, 

Ht  see  Tate  and  Klett  (1959)). 

Note  that  the  confidence  interval  in  (2.1)  is  also  exact  when 
-*•  •  and  n2  is  fixed,  or  when  +  “  and  n^  is  fixed. 

To  generalize  the  MLS  procedure  to  nonnegative  linear  combinations 
of  K  variances,  we  proceed  as  follows: 

a)  -  n1s2/01  are  independent  chi-square  random  variables  with 

n.  degrees  of  freedom  for  i  ■  1,  2,  . . . ,  K. 

K 

b)  define  S  by  0  «  l  c.6  where  c  ^  0,  cv  -  1; 

i-1  1  * 

c)  an  approximate  1  -  a  confidence  interval  on  9  is 

EciSi  “^ILiciSi  -  0  -  EciSi  +l/5:HiciSi  (2.2) 

where  L.  ■  1  -  1/F  ;  H.  ■  1/F  -  1  where  a.,  >0, 

1  ail:  ni’  "  1  ai2:  ni’  "  11 

*12  >  °*  »il  +  a^2  *  1  for  i  *  1,  2,  ...,  K.  The  can  be  chosen 

for  equal  tails,  for  shortest,  or  for  shortest  unbiased 

confidence  intervals  when  K  -  1  of  the  0^  are  zero.  The  confidence 

interval  in  (2.2)  is  exact  when  (1)  any  K  -  1  of  the  6^  are  zero; 

(2)  when  any  K  -  1  of  the  n^  -*  <*>.  When  any  M  of  the  0^  *  0  for 

M  <  K  the  resulting  confidence  interval  reduces  to  the  MLS  confidence 

Interval  for  a  nonnegative  linear  combination  of  the  remaining  K  -  M 

variances  0^;  also  when  any  M  of  the  n^  for  M  <  K  the  resulting 
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confidence  interval  reduces  to  the  MLS  confidence  interval  for  a  non- 
negative  linear  combination  of  the  remaining  K  -  M  variances  0^. 

3.  THE  SATTERTHWAITE,  WELCH,  AND  MODIFIED  LARGE  SAMPLE  METHODS 
Let  Z  ■  (cS^  +  S^)/(c0^  +  0^)  »  pU^/n^  +  (l-p)!^/^  where  c  ^  0 
P  ■  c0^/(c0^  +  ©2),  and  and  Uj  are  independently  distributed  as 
chi-square  random  variables  with  n^  and  ^  degrees  of  freedom  respec¬ 
tively.  To  determine  the  distribution  of  Z  we  use  the  following 
theorem  (see  Fleiss  (1971)). 

U2 

Theorem  1.  The  distribution  of  Z  conditional  on  W  *  — — -----  *  w  is 

Ui  U2 

that  of  k(U^  +  U2),  where  k  ■  p(l-w)/n^  +  (l-p)w/n2  and  U^,  U2  are 
independent  chi-square  random  variables  with  n^  and  degrees  of 
freedom  respectively. 

The  following  corollary  can  be  used  to  evaluate  the  probability 
coverages  of  confidence  intervals  on  cO^^  +  02  for  the  Satterthwaite, 
Welch,  and  MLS  methods. 

Corollary  1.  Let  z(W)  be  a  function  of  W.  Then  P [Z  z(W)l  is 

PIZ  <,  z(W) ]  -  f^H  (z(w)/ [p(l-w)/n.  +  (l-p)w/n«] )p(w)dw, 

0  12  1  L 

x  n  /2-1  n  /2-1 

where  p(w)  -  jj — p  w  (1-w)  is  the  p.d.f.  of  W 

and  Hn(.)  is  the  c.d.f.  of  a  chi-square  random  variable  with  n  de¬ 
grees  of  freedom. 


-6- 


The  confidence  limits  of  the  Satterthwaite,  Welch  and  MLS 
methods  are  functions  of  W  so  this  corollary  and  numerical  integra¬ 
tion  can  be  used  to  evaluate  the  confidence  coefficients  as  a  function 
of  the  unknown  parameter  p  and  specified  values  of  n^  and  n^> 

4.  EVALUATION  OF  CONFIDENCE  COEFFICIENTS  AND  EXPECTED  WIDTHS 
It  is  seen  that  the  probability  coverages  associated  with  the 
Satterthwaite,  Welch,  and  MLS  approximate  confidence  intervals  for 
c0i  +  depend  on  c  and  the  population  parameters  0^,  0^  only  through 
the  unknown  parameter  p  defined  by  p  ■  c0^/(c0^  +  @2)*  Clearly 
0  p  <_  1.  Simpson's  rule  with  interval  size  h  *  0.01  was  used  to 
evaluate  the  integral  (the  probability)  in  Corollary  1  for  the  dif¬ 
ferent  functions  z(w)  given  by  the  Satterthwaite,  Welch,  and  MLS 
methods.  The  IMSL  subroutine  MDCH  was  used  to  compute  the  chi- 
square  distribution.  The  values  of  p  used  were  p  *  0.0  (0.1)  1.0; 
all  combinations  of  the  following  values  of  n^  and  n2  were  examined 
for  l-o  equal  to  .90  and  .95. 

nx:  4,  5,  6,  7,  8,  9,  10,  15,  20,  30 

n2:  4,  5,  6,  7,  8,  9,  10,  15,  20,  30 

Tables  2  and  3  contain  some  of  the  results.  The  entries  are  the  ranges 
that  the  confidence  coefficients  vary  as  the  unknown  parameter  p  varies 
in  the  set  0.0  (  .1)  1.0.  The  column  headed  MLS1  is  for  "equal  tails" 
confidence  intervals;  the  column  headed  MLS2  is  for  "shortest  unbiased" 

confidence  intervals;  the  column  headed  MLS3  is  for  "shortest"  confidence 

intervals.  These  are  defined  in  Section  2. 
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To  evaluate  the  expected  lengths  a  simulation  study  was  con¬ 
ducted.  One  thousand  chi-square  random  numbers  were  generated  using 
the  IMSL  subroutine  GGCSS  (chi-square  random  deviate  generator)  for 


each  pair  of  values  of 

nl 

and 

n2 

listed 

below 

nl 

4, 

4, 

4, 

8, 

8, 

16 

n2 

4, 

8, 

30, 

8, 

30 

,  32 

From  these  random  numbers  the  three  ratios,  r^,  an<i  r^  were  eval“ 

uated  for  1  -  a  *  .90  and  1  -  a  ■  .95  where 

Average  length  of  MLSi  confidence  interval 
Average  length  of  Welch  confidence  interval  i 

The  results  are  recorded  in  Tables  4  and  5  broken  down  for  p  =  0.0(. 2)1.0. 

The  ratios  depend  on  c,  9^  and  02  only  through  the  parameter  p. 

Some  conclusions  from  the  formulas  are  as  follows: 

(1)  The  results  are  for  all  values  of  ciO. 

(2)  Only  the  MLS  methods  give  correct  asymptotic  results  for 
large  n^  and  small  n^  for  i  ^  j . 

(3)  When  p  *•  0  only  the  Satterthwaite  and  MLS  methods  are  exact. 

(4)  When  p  ■  1  only  the  Satterthwaite  and  MLS  methods  are  exact. 

(5)  The  MLS  methods  are  easy  to  compute  even  for  nonnegative  linear 
combinations  of  K  variances. 

Some  conclusions  from  the  tables  are  as  follows: 
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(1)  The  confidence  coefficients  for  the  Welch  method  are  closer 
to  the  nominal  values  than  the  Satterthwaite  method  but  the 
Welch  method  is  more  difficult  to  compute. 

(2)  The  confidence  coefficients  for  the  Welch  and  Satterthwaite 
methods  can  fall  several  points  below  the  nominal  level. 

This  is  undesirable. 

(3)  The  confidence  coefficients  for  the  MLS  methods  appear  to  be 
greater  than  or  equal  to  the  nominal  values. 

(4)  The  MLS2  and  MLS3  methods  give  confidence  intervals  whose 
average  widths  are  generally  smaller  (and  sometimes  signifi¬ 
cantly  smaller)  than  the  average  widths  of  the  Welch  method. 

A  very  small  study  was  conducted  for  K  *  3.  An  extension  of  Corol¬ 
lary  1  was  used  to  evaluate  the  confidence  coefficients  for  the  MLS2 
and  Welch  methods.  The  results  are  in  Table  6  where  the  entries  are 
the  ranges  of  the  confidence  coefficients  as  the  unknown  parameters 
vary  in  the  interval  [0,  1) .  The  conclusion  is  that  the  MLS2  method 
is  better  than  the  Welch  method  when  confidence  coefficients  are  com¬ 
pared. 

From  these  conclusions  it  seems  that  the  MLS2  and  MLS3  methods  are 
to  be  preferred  over  the  Welch  or  Satterthwaite  methods  for  computing 
confidence  intervals  on  nonnegative  linear  combination  of  variances. 
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Table  1 

Ranges  of  Confidence  Coefficients  for  Welch  Method 

1  -  a  -  .95 


n2 

Range  of 

Confidence  Coefficients 

100 

.905  -  .956 

100 

.937  -  .952 

Table  2 


Ranges  of  Confidence  Coefficients  (Times  10  ) 
for  Satterthwaite,  Welch  and  MLS  Procedures 


1  -  a  -  0.90 


nl 

n2 

S 

W 

MLS1 

MLS2 

MLS3 

4 

886-924 

889-907 

900-921 

900-929 

900-946 

874-921 

886-907 

900-921 

900-925 

900-942 

868-919 

884-907 

900-920 

900-924 

900-939 

865-918 

882-909 

900-920 

900-923 

900-937 

840-913 

869-919 

900-919 

900-919 

900-929 

5 

886-921 

893-907 

900-917 

900-925 

900-941 

882-920 

891-907 

900-917 

900-923 

900-939 

877-918 

889-906 

900-917 

900-921 

900-936 

875-917 

889-906 

900-917 

900-920 

900-934 

30 

855-912 

879-915 

900-916 

900-917 

900-926 

6 

6 

887-919 

894-907 

900-915 

900-921 

900-937 

8 

883-917 

891-906 

900-915 

900-920 

900-934 

10 

881-916 

892-906 

900-915 

900-919 

900-932 

30 

864-911 

885-910 

900-914 

900-915 

900-924 

8 

8 

890-915 

895-905 

900-912 

900-917 

900-931 

10 

888-914 

894-905 

900-912 

900-916 

900-928 

30 

875T910 

891-905 

900-912 

900-913 

900-918 

10 

10 

892-913 

896-904 

900-910 

900-914 

900-926 

30 

882-908 

894-903 

900-908 

900-911 

900-918 

30 

30 

897-905 

899-901 

900-904 

900-905 

900-910 

Table  3 


•O 

Ranges  of  Confidence  Coefficients  (Times  l(r) 
for  Satterthwaite ,  Welch  and  MLS  Procedures 


1  -  a  -  0.95 


nl 

n2 

S 

W 

MLS1 

MLS2 

MLS3 

4 

4 

939-965 

938-951 

949-961 

950-969 

950-980 

6 

928-963 

936-952 

950-961 

950-967 

950-978 

8 

921-962 

932-953 

950-962 

950-965 

950-976 

10 

918-962 

932-953 

950-962 

950-964 

950-975 

30 

893-958 

919-954 

950-962 

950-963 

950-970 

5 

5 

939-963 

942-952 

949-960 

950-966 

950-977 

6 

935-963 

941-953 

949-960 

950-965 

950-976 

8 

930-961 

938-953 

950-960 

950-964 

950-975 

927-961 

938-953 

950-960 

950-963 

950-974 

30 

907-958 

928-964 

950-960 

950-958 

950-968 

6 

6 

939-962 

944-953 

949-959 

950-963 

950-975 

8 

935-961 

942-953 

947-959 

950-963 

950-973 

10 

932-960 

941-953 

950-958 

950-962 

950-972 

30 

916-957 

934-961 

950-959 

950-958 

950-967 

8 

8 

941-960 

945-953 

950-957 

950-960 

950-971 

10 

939-959 

944-953 

950-957 

950-960 

950-970 

30 

927-956 

941-952 

950-957 

950-958 

950-965 

10 

10 

943-958 

946-953 

950-956 

950-959 

950-969 

30 

934-956 

944-952 

950-955 

950-957 

950-964 

30 

30 

948-953 

949-951 

950-952 

950-953 

950-958 

2- 


Ratios  of  Expected  Length  of  MSL  to  Welch  Confidence  Intervals 


Ratios  of  Expected  Length  of  MLS  to  Welch  Confidence  Intervals  on  cf^  +  6 

1  -  o  *  0.95 


Table  6 


Ranges  of  Confidence  Coefficients  for 
Confidence  Intervals  on 

C101  +  c202  +  03 


1  -  a  -  0.95 


MLS2 

1 

Welch 

.550-. 970 

.930-. 952 

.950-. 969 

.930-. 954 

.950-. 966 

.930-. 954 

.950-. 961 

.944-. 953 
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Smith  (1936)  suggested  a  method  that  can  be  used  for  setting 
confidence  limits  on  linear  combinations  of  variances.  This  method 
was  studied  and  expanded  by  Satterthwaite  (1941,  1946)  and  has  be¬ 
come  known  as  Satterthwaite ' r.  procedure.  The  procedure  has  been 
widely  used  for  the  past  40  years.  In  this  paper  a  new  procedure 
is  proposed  for  this  problem  that  is  better  than  Satterthwaite ' s 
procedure  and  very  easy  to  compute  from  existing  tables. 


